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ABSTRACT 

Efficient  scheduling  of  jobs  for  computer  systoms  Is  e problem  of 
continuing  concern.  Tho  applicability  of  scheduling  methodology  described 

in  the  operations  research  literature  is  severely  restricted  by  the 

dimensionality  of  job  characteristics,  tho  number  of  distinct  rosourco 
types  comprising  a computer  system,  tho  non-deterministic  nature  of  tho 
system  due  to  both  Interprocess  Interaction  and  contention,  and  the 

existence  of  a multitude  of  constraints  offocting  job  Initiation  times,  job 
completion  times,  and  job  interactions.  In  view  of  the  largo  number  of 
issues  which  must  be  considered  in  job  scheduling,  a heuristi-  approach 

seems  appropriate.  This  paper  describes  an  Initial  implementation  of  such 
an  approach  based  upon  a fast,  analytically  driven,  performance  prediction 

tool. 


1.  INTRODUCTION 

Job  scheduling  Is  an  issue  of  continuing  concern  to  computer  center 
management.  This  concern  reflects,  in  IargB  part,  the  inapplicability  of 
the  substantial  body  of  work  on  scheduling  existing  within  tho  operations 
research  literature  [BAKEK  71],  [BELLL  70],  [CONWR  67],  [SAHNV  72].  This 
Inapplicability  is  due  to  the  fact  that  most  of  tho  operations  rosearch 
literature  on  scheduling  Is  concerned  with  either  deterministic  scheduling, 
which  Is  based  on  mathematical  programming  techniques,  or  probabilistic 
scheduling,  which  Is  based  on  quouing  theory.  However,  a computer  system 
contains  both  deterministic  and  probabilistic  components.  Further,  some 

devices,  e.g.,  tapes  and  disks,  may  alter' iate  Fotween  shared  and  serially 
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Projects  Agency  under  Contract  No.  DAHC15  72  C 0308,  ARPA  Order  No.  2223, 
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(NR  049-311). 
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author  and  should  not  be  Interpreted  as  necessarily  representing  the 
official  policies,  either  expressed  or  implied,  of  tho  Advanced  Research 
Projects  Agency,  tho  Office  of  Naval  Rosearch,  or  tho  U.S.  Government. 
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reusable  status.  Hence  the  device  characteristics  can  h3ve  deterministic 

status  changes  and  probabilistic  servico  times.  it  follows  that  neither 
deterministic  nor  probabilistic  approaches  can  be  used  in  Isolation,  and 
attempts  to  do  so  usually  result  in  information  which  provides  a groat  deal 
of  insight  but  little  practical  guidance  in  constructing  actual  schedules 
for  the  execution  of  jobs  on  computer  systems. 

The  evident  Inability  of  analytic  approaches  to  handle  the  complexity 
of  computer  system  scheduling  suggests  the  desirability  of  developing 
heuristic  techniques  for  job  scheduling.  Such  techniques  have  been  used  In 
a wide  variety  of  contexts  within  computer  engineering;  a notable  success 
Is  their  utilization  in  developing  ’good’  topologies  for  ARPANET 

[FRANH  70,  72]. 

Development  of  a heuristic  approach  to  computor  systems  scheduling 
requires: 

1.  Prediction  of  the  performance  of  a given  computer  system 

processing  a specified  workload  in  accord  with  a given  schedule, 

2.  Identification  of  a means  for  comparing  two  schedules  and 

Improving  a given  schedule,  and 

3.  Determination  of  when  the  Iteration  cycle  implicitly  defined 

by  (1)  and  (2)  should  stop. 

A fundamental  difficulty  in  developing  computor  system  schedules  has 
been  the  cost  implicit  in  (1).  That  is,  the  relative  desirability  of  the 
schedule  Is  related  to  device  utilizations  and  delays.  However,  simulators 

which  are  sufficiently  detailed  to  provide  useful  information  on  device 
delays  and  utilizations  usually  exhibit  execution  times  which  tend  to  be  in 
the  lower  end  of  the  range  1-10  times  faster  than  real  time.  Thus, 

heuristic  schedule  generation  based  on  sin  ulators  is  rendered  infeasible. 

In  [KIMBS  74A],  a very  fast  analytically  driven  approach  to  computer 
system  performance  prediction  for  production  batch  jobs  has  been  described. 
This  tool  seems  appropriate  to  (1)  and,  as  will  be  discussed  later,  the 
class  of  jobs  for  which  it  is  applicable  appears  to  be  the  class  of  jobs 
for  which  the  development  of  schedules  is  most  appropriate.  Thus,  the 

basic  requirement  for  developing  a heuristic  approach  to  schodullng  Is  at 
hand.  In  the  remainder  of  this  paper,  wo  shall  identify  a means 

forcomparing  schedules  and  discuss  an  initial  heuristic  approach.  An 
example  of  the  output  from  this  approach  is  given  in  Section  4. 

This  paper  Is  not  intended  to  be  a contribution  to  either  scheduling 

theory,  per  se,  or  to  computer  system  modeling.  Rather,  It  Is  Intended  to 

demonstrate  that  the  proper  combination  of  theoretical  knowledge  from  each 
of  these  two  areas  can  provide  a practical  means  of  solving  a problem  of 
continuing  concern  to  computer  center  managers  In  a cost  effective  manner. 
Further,  without  such  a theoretically  based,  practically  oriented  approach, 
a usable  solution  to  this  problem  appears  remote. 


2.  COMPUTER  SYSTEM  SCHEDULE  COMPONENTS 

In  this  section  we  define  what  Is  meant  by  computer  system  schedule, 
identify  a class  of  jobs  most  appropriate  to  scheduling,  describe 
constraints  to  be  observed  In  constructing  a schedule,  and  Identify  8 
quantitative  means  for  schedule  comparison. 


Job  Types 

The  jobs  to  be  processed  by  a computer  systom  can  be  divided  Into  four 
categories,  depending  upon  whethor  or  not  timos  of  arrival  and  rosource 
requirements  are  known.  External  scheduling  Is  most  appropriate  for  jobs 
whose  arrival  times  and  resource  requirements  are  known.  In  the  remainder 
of  this  paper,  wo  are  concerned  with  this  class  of  jobs  formed  the 
production  batch  jobs. 


Schedule  Definition 

Label  the  collection  of  jobs  to  bo  processed  during  a shift  1,...,N. 
A schedule  Is  a permutation  P(1),...,P(N),  of  these  Indices  describing  the 
sequence  Ir  which  jobs  are  to  be  initiated.  Jobs  will  bo  Initiated  in  the 
(left  to  right)  order  Indicated  suojoct  to:  (1)  job  P(l)  Is  Initiated 

first,  (2)  once  a job  is  initiated,  that  job  remains  resident  In  the  systom 
until  It  is  completed,  l.e.  its  execution  Is  not  involuntarily  suspendod, 

(3)  If  jobs  P<  1 ),...,P<j)  have  been  initiated,  the  next  job  to  bo  Initiated 
will  be  job  P(j+1),  and  (4)  If  jobs  P(l),...,P(j-l)  havo  been  Initiated, 
Job  P(j)  will  be  initiated  Immediately  onco  all  constraints  affecting  Its 
Initiation  have  been  satisfied  (cf.  the  subsection  below  discussing 
constraints). 


Schedule  Constraints 

Any  permutation  P(1),...,P(N)  constitutes  a schedule.  A schedule 

which  satisfies  all  constraints  is  termed  feasible.  For  batch  production 
Orientod  schedules,  four  major  types  of  constraints  exist:  priorities, 

deadlines,  precedence  relations  and  resource  requirements.  For  simplicity 

we  shall  assume  that  all  jobs  have  the  same  priority  and  consist  of  one 

jobstep.  The  requirod  extensions  to  Incorporate  muitiplo  (operational) 
priority  classes  and  jobs  consisting  of  multiple  jobstops  will  bo  evident 
once  the  general  approach  has  been  described.  We  also  assume  that  if  o 

precedence  relation  exists  betwoen  jobs  P(j)  and  F'(k),  j < k,  then  job  P(j) 
must  be  completed  before  job  P(k)  can  be  inflated.  Finally,  we  shall 

assume  that  setup/teardown  timos  aro  negligible. 


Comparing  Computer  System  Schedules 


For  a production  batch  computer  systom,  the  natural  objeciivo  function 

is  to  minimize  usage  of  sys*jm  resources  subject  to  tho  constraint  that  aii 

deadlines  are  met.  (Njte  that  this  objective  function  also  permits 
incorporation  of  tur  ground  time  limitations  through  development  of 
artificial  deadlines  co  responding  to  the  time  of  arrival  plir  acceptable 
mean  turnaround  thie.)  7o  formulate  a quan‘ita.ive  analogue  of  this 
concept,  we  inhoduce  the  concept  of  the  shift  system  reward  funr>!on 
(SSRF).  This  requires  introduction  of  two  prior  definiiiois:  event  and 

time  segment. 

DEFINITION.  An  event  is  a point  in  time  at  which  either  a job  initiation 
or  job  termination  occurs.  A time  segment  Is  the  time  betwoen  two 

successive  events. 

A schedule  containing  N jobs  will  have  precisely  2N  ovents  and  2N  - 1 
time  segments.  Tho  composition  of  the  mix  is  constant  ever  a time  segment 
and  thus  It  is  appropriate  to  define  the  time  segment  system  reward 

function  (TSSRF)  for  a given  time  segment  as: 

TSSRF  - U(P)C(P)  + U(M)C(M)  + U(S)C(S)  + U(D)C(D) 

C(P)  + C(M)  + C(D)  + C(S>  + C(U) 

where  P,M,D,S,U  denote  processor,  memory,  secondary  storage,  data  paths  and 
unit  record  equipment  respectively,  U(*)  donotos  the  utilization  of  the 
device  ovor  the  time  segment  and  C(*)  denotes  the  monthly  rental  cost  of 
the  device. 

Our  objective  function  can  now  be  formally  stated  as: 

MIN  SSRF  (- 2 TSSRF) 
subject  to: 

NO  MISSED  DEADLINES, 

where  the  sum  oxtends  over  all  the  time  segments  comprising  the  shift. 

Noto  that  this  objective  function  attempts  to  minimize  the  total  cost 
weighted  utilization  of  tho  system  by  the  collection  of  production  batch 
jobs  which  must  be  processed  during  the  shift.  Thus  It  tends  to  ensure 
that  in  executing  these  jobs,  the  system  will  be  mad9  maximally  available 
(in  a cost  sense)  to  the  jobs  in  the  other  categories,  subject  to  the 
requirement  that  all  deadlines  be  met. 

in  common  with  almost  ail  optimization  techniques,  this  approach  does 
not  assure  maximal  availability  to  other  job  classes  since  it  is  possible 
that  a schedule  may  be  found  for  which  some  device  Is  effectively  operated 
at  tho  edge  of  saturation.  However,  as  will  be  evident  from  the  opproach, 
this  can  be  avoided  by  a simple  modification  of  the  heuristic  used  to  limit 
the  maximum  utilization  for  any  given  device. 
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3.  SCHEOULE  GENERATION 


I 


Heuristic 
Interaction  of 


generation  of  a schedule  requlros  characterization  of  the 
a job  with  a computer  system.  For  production  batch  Jobs,  o 


synthetic  module  [HAM'R  73],  [SREEK  74]  charactorizahon  proves 


satisfactory. 


Generation  of  an  appropriate  schedule  now  proceeds  in  two  phases: 
construction  of  an  Initial  schedule  and  Improvement  of  a given  schedule. 


initial  Schedule  Generation 

Generation  of  an  initial,  seizable  schedule  proves  difficult  since  tho 
r.chedule  determines  and  is  determined  by  tho  execution  times  of  the  Jobs 

being  processed.  This  impasse  is  circumvented  by  Initially  assigning  a 
(conservative)  deterministic  processing  time  to  a Job.  Using  this 
deterministic  estimator,  a sorting  technique  can  then  be  used  to  obtain  a 
feasiblo  schedule,  l.o.,  a schedule  needing  deadline  requirements  and 

satisfying  precedence  constraints.  Treesort  [KNUTD  $8]  appears  most 

appropriate  since,  in  offect,  the  calculations  are  analogous  to  those  used 
in  PERT  Scheduling.  Since  the  deterministic  estimators  overestimate  job 

processing  times,  the  initial  schedule  generator  may  fall  to  find  e 
feasible  schedule.  In  this  case,  one  may  either  torminato  the  heuristic 

search,  or  follow  the  standard  approach  of  entering  the  schedule 

Improvement  routino. 


Schedule  Improvement  Routine 

Thu  development  of  heuristics  for  achieving  an  improved  schedule  from 
a given  schedule  in  the  context  which  has  been  described  has  rocolved  very 
little  study.  Accordingly,  as  a first  cut  at  the  probiom,  a very  simple 
schedule  Improvement  routine  has  been  implemented.  This  routine  In  effect 
constructs  the  graph  of  the  TSSRF  plotted  against  the  time  segment  number 
and  then  attempts  to  level  the  ’hilis’  by  filling  the  ’valleys.’  It  is 
widely  known  that  a transposition  technique  such  as  described  may  fall  to 
find  an  optimal  schedule.  However,  there  is  reason  to  think  that  it  Is  a 
fairly  reasonable  technique  with  a reasonable  likelihood  of  finding  a 
’good’  schedule  [LINS  65].  Some  results  from  tho  application  of  this 

technique  a'\e  given  in  the  following  soction. 


Stopping  tho  Search 

Any  iterative  optimization  technique  requires  the  existence  of  a 
stopping  procedure.  At  tho  present  stato  of  development  of  the  technique 
Just  described,  such  procedures  are  dearly  inappropriate.  Thus,  we  havo 
chosen  to  Implement  a very  simple  ‘ochnlque  which,  In  offect,  docides  to 
stop  when  either  a cortf  in  number  of  iterations  have  been  roachod  or  whor, 
tho  dlfforor.ee  between  the  maximum  and  minimum  TSSRF  falls  within  some 
specified  tolerance  limit.  Better  procedures  are  needed. 
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4.  AN  EXAMPLE 

An  evaluation  of  a heuristic  simulate  should  be  conducted  along  two 
dimensions:  detection  and  agility.  ^ialitatively,  the  ability  to  detect 
undesirable  scheduling  characteristic?  through  examination  of  the  time 
sequence  of  TSSRFs  appears  to  be  satisfactory.  Space  limitations  preclude 
presentation  of  a series  of  examples  to  support  this  assertion. 

The  agility,  in  a general  environment,  of  the  schedule  Improvement 
routine  remains  open  to  question.  It  is  relatively  easy  to  construct 
examples  in  which  the  agility  appears  to  be  either  good  or  bad,  depending 
upon  the  characteristics  of  the  example.  Significant  refinement  In  the 
heuristics  is  clearly  desirable.  The  utility  of  such  refinements  is 
evidenced  by  the  ability  of  an  Interactive,  man-machine  version  to  produce 
good  schedules.  In  this  approach,  the  Initial  schedjle  generator  Is  used 
to  obtain  the  first  schedule,  and  the  ’man’  is  thereafter  responsible  for 
generation  of  heuristics  and  determination  of  when  to  stop. 

The  output  of  the  scheduler  Is  divided  among  five  output  filos 
representing  five  different  categories  of  Information,  Space  limitations 
therefore  preclude  a complete  description  of  the  output  for  even  a simple 
example.  Accordingly,  we  have  chosen  to  describe  the  two  major  Input  files 
(job  characteristics,  system  characteristics)  to  the  scheduler  for  a simple 
five  job  example.  In  this  example,  the  resource  requirements  of  the  jobs 
are  Identical  and  the  differences  are  reflected  only  In  the  duo  dates  and 
times  of  arrival  which  were  chosen  to  render  development  of  a feasible 
schedule  impossible.  Table  3 presents  example  time  segment  statistics  and 
Table  A provides  a comparison  of  the  characteristics  of  the  schedule 
developed  by  the  initial  schedule  generator  and  the  revised  schedule  after 
one  pass  with  the  schedule  Improvement  routine.  The  capability  of  the 
scheduler  to  handle  dedicated  disks  and  tapes  was  not  demonstrated  due  to 
space  limitations.  It  is  hoped  that  examination  of  this  information  will 
persuade  the  reader  of  the  potential  merits  of  a heuristic  approach.  A 
forthcoming  technical  .report  will  provide  a more  substantive  basis  for  this 
examination. 
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TABLE  1 

SYSTEM  CHARACTERISTICS 

Number  of  Drums 

2 

Number  of  Pages/Track 

6 

Drum  Rotation  Time 

35ms. 

Number  of  Disks 

8 

Number  of  Pages/Track 

3 

Disk  Rotation  Time 

25ms. 

Minimum  Seek  Time 

10ms. 

Maximum  Seek  Time 

60ms. 

Average  Seek  Time 

34ms. 

Cost  Por  Unit  Time  of 

CPU 

.2 

Core 

1.9 

Drum 

.5 

Disk 

.03 

TABLE  2 

JOB  CHARACTERISTICS 

Time  of  Arrival 

*** 

Due  Date 

*** 

CPU  Time  Required 

40  s. 

CPU  Time/I/O  Operation 

40ms. 

Primary  Memory  Required 

20pp. 

Number  of  I/O  Devices  Accessed 

10 

Drum  Access  Characteristics 

Probability  of  Access  to  Drum  1, 1-1,2 

.333 

Pages  Transferred  Per  Access  to  Drum  1, 1-1,2 

1 

Average  Latency  Per  Access  to  Drum  1, 1-1,2 

17ms. 

Probability  of  Access  to  Disk  1, 1-1,. ..,8 

.042 

Pages  Transferred  Por  Access  to  Disk  1, 1-1,. ..,8 

2 

Average  Number  of  Cylinders 

Per  Seek  for  Disk  1, 1-1,. ..,8 

100 

***  Indicates  statistics  varied  on  a por  job  basis. 


TABLE  3 
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REPRESENTATIVE  TIME  SEGMENT  STATISTICS 


Time  Segment  Beginning  79.75  6. 

Time  Segment  End  178.79  s. 

Degree  of  Multiprogramming  2 

Utilizations 

CPU  .51 

Core  .AO 

Drum  .13 

Disk  .04 

Systom  Reward  Function  .195 


TABLE  4 

COMPARISON  OF  OUTPUT  FROM  INITIAL  SCHEDULE 
GENERATOR  AND  SCHEDULE  IMPROVEMENT  ROUTINE 
{ONE  PASS) 


Schedule  Begin 

0.0 

0.0 

Schedule  End 

328.27 

277.83 

Total  Number  Jobs  Processed 

5 

5 

Average  Degree  of  Multi- 

programming 

1.25 

1.70 

Average  I/O  Time 

36ms. 

38ms. 

Average  Processor  Wait  Between 
Two  Successive  I/O 

Operations 

6ms. 

14ms. 

SRF 

.12 

.16 

Utilizations 

Core 

.52 

.50 

Primary  Memory 

.26 

.34 

Drum 

.04 

09 

Disk 

.01 

.03 

5.  CONCLUDING  REMARKS 

The  preceding  discussion  Is  Intended  to  demonstrate  that  through 
proper  application  and  combination  of  theoretical  results,  an  approach  to 
external  job  scheduling  can  be  achieved  which  is  of  practical  significance. 
Cloarly,  the  cost  of  this  approach  depends  upon  the  tool  used  to  prodict 
systom  ^.rformance  for  a given  schedule,  i.e.  the  execution  speed  of  ASIM. 
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In  Its  present  version,  speeds  significantly  faster  than  those  otherwise 
available  are  achieved  (for  production  batch  jobs).  Inspection  of  this 
version  shows  that  It  is  highly  likely  that  a revised  version  with  an 
execution  speed  which  Is  approximately  one  quarter  of  the  current  version 
can  be  achieved. 

The  heuristic  approach  to  scheduling  as  described  uses  existing 
theoretical  scheduling  insights  in  only  a limited  manner.  Clearly,  It  Is 
desirable  to  examine  the  applicability  of  shortest  Job,  smallest  job,  and 
first  fit  types  of  heuristics. 
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